Generating Realignment-Based Communication for HPF Programs
نویسندگان
چکیده
This paper presents methods for generating communication on compiling HPF programs for distributed-memory machines. We introduce the concept of an iteration template corresponding to an iteration space. Our HPF compiler performs the loop iteration mapping through the twolevel mapping of the iteration template in the same way as the data mapping is performed in HPF. Making use of this unified mapping model of the data and the loops, communication for nonlocal accesses is handled based on datarealignment between the user-declared alignment and the optimal alignment, which ensures that only local accesses occur inside the loop. This strategy results in effective means of dealing with communication for arrays with undefined mapping, a simple manner for generating communication, and high portability of the HPF compiler. Experimental results on the NEC Cenju-3 distributed-memory machine demonstrate the effectiveness of our approach: the execution time of the compiler-generated program was within 10% of that of the hand-parallelized program.
منابع مشابه
HPF Library, Language and Compiler Support for Shadow Edges in Data Parallel Irregular Computations
On distributed memory architectures data parallel compilers emulate the global address space by distributing the data onto the processors according to the mapping directives of the user and by generating explicit inter-processor communication automatically. A shadow is additionally allocated local memory to keep on one processor also non-local values of the data that is accessed or deened by th...
متن کاملAn Expression-Rewriting Framework to Generate Communication Sets for HPF Programs with Block-Cyclic Distribution
In this paper, we present a new framework based on expression rewritings and a calculus form called CSD calculus to generate the local enumeration set and communication set for HPF programs with Block-Cyclic distribution. Our framework is a practical software framework, and can handle the general cases so that the communication set of HPF programs of “Block-Cyclic” distributions with two-level ...
متن کاملAn Expression-Rewriting Framework to Generic Communication Sets for HPF Programs with Block-Cyclic Distribution
In this paper, we present a new framework based on expression rewritings and a calculus form called CSD calculus to generate the local enumeration set and communication set for HPF programs with Block-Cyclic distribution. Our framework is a practical software framework, and can handle the general cases so that the communication set of HPF programs of \Block-Cyclic" distributions with two-level ...
متن کاملADDT: Automatic Data Distribution Tool for Porting Programs to PVM
An important issue faciing users who develop applications on PVM is that of partitioning the data over the different processors and generating communication statements for non-local dlata accesses. The burden of generating communicatio:n statements is alleviated by data parallel languages like HPF. But HPF requires the user to specify the distribution of the data in the application. This distri...
متن کاملAbout the Performance of HPF: Improving Runtime on the Cray T3E with Hardware Specific Properties
High Performance Fortran permits to write parallel programs with much less programming effort than by using standard communication libraries such as MPI or PVM. The performance of compiled HPF programs is considered low, though. We show that a compiled HPF application will gain a substantial runtime improvement if compilation incorporates properties of the hardware architecture into the final p...
متن کامل